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The availability of genome-scale ONA sequence information and reagents has>adically altered life-science 
research^,, revolut.on has led to the development of a new scientific subdiscipline derived from a combma- 
vzn of the f.elds of toxicology and genomic. This subdiscipline. termed toxicogenomics. is concerned w.th the 
,oent.f .cat.on of potential human and environmental toxicants, and their putative mechan.sms of action throuah 
the use of genom.es resources. One such resource is ONA microarrays or "chips/ which allow the monitoring of 
the express.on levels of thousands of genes simultaneously. Here we propose a general method by which oene 
expression, as measured by cDNA microarrays. can be used as a highly sensitive and informative marker for 
toxicty Our purpose is to acquaint the reader with the development and current state of microarray technol- 
".all 0 Pre$ent ° ur V " w 0 the usefu,ne » microarrays to the field of toxicology. Mo/. Carcinog. 24- fSJ- 
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INTRODUCTION 

Technological advancements combined with in- 
tensive DNA sequencing efforts have generated an 
enormous database of sequence information over the 
past decade. To date, more than 3 million sequences, 
totaling over 2.2 billion bases [I), are contained 
within the GenBank database, which includes the 
complete sequences of 19 different organisms (2|. The 
: :rst complete sequence of a free-living organism. 
Haemophilus influenzae, was reported in 1995 (3| and 
was followed shortly thereafter by the first complete 
sequence of a eukaryote. Saccharomyces cermiae |4|. 
The development of dramatically improved sequenc- 
ing methodologies promises that complete elucida- 
tion or tne Homo jjwens DNA sequence is not far 
benind 151 

To expioirmore ruilv the wealth or new sequence 
information, it was necessary to develop novel meth- 
ods for the high-throughput or parallel monitoring 
of gene expression. Established methods such as 
northern blotting, RNAse protection assays. SI nu- 
clease analysis, plaque hybridization, and slot blots 
do not provide sufficient throughput to effectively 
utilize the new genomics resources. Newer methods 
such as differential display (6). high-density filter 
hybridization [7.8], serial analysis of gene expression 
(9j, and cDNA- and oligonucleotide-based microarray 
"chip" hybridization (10-12) are possible solutions 
to this bottleneck, it is our belief that the microarray 
approach, which allows the monitoring of expres- 
sion levels of thousands of genes simultaneously, is 
a tool of unprecedented power for use in toxicology 
studies. 



Almost without exception, gene expression is al- 
tered during toxicity, as either a direct or indirect 
result of toxicant exposure. The challenge facing 
toxicologists is to define, under a given set of ex- 
perimental conditions, the characteristic and spe- 
cific pattern of gene expression elicited bv a given 
toxicant. Microarray technology offers an ideal plat- 
form for this type of analysis and could be the foun- 
dation for a fundamentally new approach to 
toxicology testing. 

MICROARRAY DEVELOPMENT AND APPLICATIONS 
cDNA Microarrays 

in the past several years, numerous systems were 
developed for the construction of large-scale DNA 
arravs. Ail or tnese platrorms are oasea on cDNAs 
or oligonucleotides immobilized to a solid sup- 
port, in the cDNA approach. cDNA (or genomic) 
clones of interest are arrayed in a multi-well for- 
mat and amplified by polymerase chain reaction. 
The products of this amplification, which are usu- 
ally 500- to 2000-bp clones from the 3' regions of 
the genes of interest, are then spotted onto solid 
support by using high-speed robotics. By using 
this method, microarrays of up to 10 000 clones 
can be generated by spotting onto a glass substrate 
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(13.141. Sample detection for microarrays on glass 
. involves the use of probes labeled with fluores- 
cent or radioactive nucleotides. 

Fluorescent cDNA probes are generated from con- 
trol and test RNA samples in single-round revene-tran- 
scnption reac tions in the presence of fluorescentiv 
tagged dLTP (e.g., Cy3-dLTP and CyS-dLTP). which 
produces control and test products labeled with dif- 
ferent rluors. The cDNAs generated from these two 
populations, collectively termed the "probe." are then 
mixed and hybridized to the array under a glass cov- 
- erslip (10.11.151; The fluorescent signal is detected 
by using a custom-designed scanning confocal mi- 
croscope equipped with a motorized stage and lasers 
tor fluor excitation (10.1 1.15]. The data are analyzed 
with custom digital image analysis software that de- 
termines tor each DNA feature the ratio of fluor 1 to 
fluor 2. corrected for local background (16.17). The 
strength of this approach lies in the ability to label 
RNAs from control and treated samples with differ- 
ent fluorescent nucleotides, allowing for the simul- 
taneous hybridization and detection of both 
populations on one microarray. This method elimi- 
nates the need to control for hybridization between 
arrays. The research groups of Drs. Patrick Brown and 
Ron Davis at Stanford University spearheaded the 
effort to develop this approach, which has been suc- 
cessfully applied to studies of Arabidopsis thaliana 
RNA (10] ( yeast genomic DNA (151, tumorigenic ver- 
sus non-tumorigenic human tumor cell lines (11], 
human T-cells (181, yeast RNA [191, and human in- 
flammatory disease-related genes (20). The most dra- 
matic result of this effort was the first published 
account or gene expression of an entire genome, that 
ot the yeast Saccharomvces crnisiae (21). 

in an alternative approach, large numbers of cDNA 
clones can be spotted onto a membrane support, al- 
beit at a lower density [7.221- This method is useful 
for expression profiling and large-scale screening and 
mapping or genomic or cDNA clones [7.22-24|. In 
expression proniinc on niter membranes, two uu- 
rerent membranes are used simuitaneousiv ror con- 
trol and test RNA hybridizations, or a single 
membrane is stripped and reprobed. The signal is 
detected by using radioactive nucleotides and visu- 
alized by phosphorimager analysis or autoradiogra- 
phy. Numerous companies now sell such cDNA 
membranes and software to analyze the image data 
125-27]. 

Oligonucleotide Microarrays 

Oligonucleotide miCToarrays are constructed either 
by spotting prefabricated oligos on a glass support 
(13) or by the more elegant method of direct in situ 
oiigo synthesis on the glass surface by photolithog- 
raphy [28-30]. The strength of this approach lies in 
its ability to discriminate DNA molecules based on 
single base-pair difference. This allows the applica- 
tion of this method to the fields of medical diagnos- 



tics, pharmacogenetics, and sequencing by hybna 
ization as well as gene-expression analysis. 

Fabrication of oligonucleotide chips by photoli. 
thography is theoretically simple but technically 
complex (29.30). The light from a high-intensi^ 
mercury lamp is directed through a photolitho. 
graphic mask onto the silica surface, resulting m 
deprotection of the terminal nucleotides in the illu. 
mmated regions. The entire chip is then reacted with 
the desired tree nucleotide, resulting in selected chain 
elongation. This process requires onlv 4n cvcles 
i where n = oligonucleotide length tn bases) to*$vn.. 
thesize a vast number of unique oligos. the total num- 
ber of which is limited only by the complexity of the 
photolithographic mask and the chip size \29.2 

Sample preparation involves the generation of 
double-stranded cDNA from cellular polvi a>* R\a 
followed by antisense RNA synthesis in an in vitro 
transcription reaction with biotinyiated or fluor- 
tagged nucleotides. The RNA probe is then frag- 
mented to facilitate hybridization. If the indirect 
visualization method is used, the chips are incubated 
with tluor-linked streptavidin <e.g., phycoervthrin) 
after hybridization 1 12.33|. The signal is detected with, 
a custom contocal scanner (34|. This method has 
been applied successfully to the mapping of genomic 
library clones (351, to de novo sequencing by hybrid* 
ization [28.36], and to evolutionary sequence com- 
parison of the BRCA1 gene |37|. In addition, 
mutations in the cystic fibrosis |38| and BRCA1 [39] 
gene products and polymorphisms in the human im- 
munodeficiency virus- 1 clade B protease gene {40| 
have been detected by this method. Oligonucleotide 
chips are also useful for expression monitoring >33| 
as has been demonstrated by the simultaneous evalu- 
ation of gene-expression patterns in nearly all open 
reading frames of the yeast strain S. mrevisiat (12|. 
More recently, oligonucleotide chips have been used 
to help identifv single nucleotide polymorphisms in 
the human Ul| and yeast |42| senomes. 

THE USE OF MICROARRAYS IN TOXICOLOGY 
Screening for Mechanism of Action 

The field of toxicology uses numerous in vivo 
model systems, including the rat. mouse, and rab- 
bit. to assess potential toxicity and these bioassays 
are the mainstay of toxicology testing. However, in 
the past several decades, a plethora of in vitro tech- • 
niques have been developed to measure toxicity, 
many of which measure toxicant-induced DNA dam- 
age. Examples of these assays include the Ames test 
the Syrian hamster embryo cell transformation as- 
say, micronucleus assays, measurements of sister 
chromatid exchange and unscheduled DNA synthe- 
sis, and many others. Fundamental to all of these 
methods is the fact that toxicity is often preceded 
by. and results in. alterations in gene expression. In 
many cases, these changes in gene expression are a 
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far more sensitive, characteristic, and measurable 
endpoiru than the toxicity itself. We therefore pro- 
pose that a method based on measurements of the 
. genome-wide gene expression pattern of an organ- 
ism after toxicant exposure is fundamentally inror- 
"m--i v e and complements the established methods 
described above. 

We are developing a method by which toxicants 
can be identified and their putative mechanisms of 
action determined by using toxicant-induced gene ex- 
pression profiles. In this method, in one or more de- 
fined model systems, dose and time-course parameters 
are established for a senes of toxicants within a given 
proton-pic class <e.g.. polycydic aromatic hydrocar- 
b*"is (PAHsm. Cells are then treated with these agents 
a: j fixed toxicity level (as measured by cell survival), 
RNA is harvested, and toxicant-induced gene expres- 
sion changes are assessed by hybridization to a cDNA 
microarray chip i Figure 1 1. We have developed a cus- 
* t5m DNA chip, called ToxChip vl.O, specifically for 
this purpose and will discuss it in more detail below. 
The changes in gene expression induced by the test 
agents in the model systems are analyzed, and the 
common set of changes unique to that class of toxi- 

.nts. termed a toxicant signature, is determined. 

This signature is derived by ranking across all ex- 
periments the gene-expression data based on rela- 

Control 
Population 



tive fold induction or suppression or genes in treated 
samples versus untreated controls and selecting rne 
most consistently different signals across the sampie 
set. A different signature may be established ror each 
prorotypic toxicant class. Once the signatures are de- 
termined, gene-expression profiles induced by un- 
known agents in these same model systems can then 
be compared with the established signatures. A match 
assigns a putative mechanism of action to the test 
compound. Figure 2 illustrates this signarure method 
for different types of oxidant stressors. PAHs. and 
peroxisome proliferators. In this example, the un- 
known compound in question had a gene-expres- 
sion profile similar to that of the oxidant stressors in 
the database. We anticipate that this general method 
will also reveal cross talk between different pathways 
induced by a single agent (e.g., reveal that a com- 
pound has both PAH-like and oxidant-like proper- 
ties), in the future, it may be necessary to distinguish 
very subtle differences between compounds within 
a very large sample set (e.g.. thousands of highly simi- 
lar structural isomers in a combinatorial chemistry 
library or peptide library). To generate these highly 
refined signatures, standard statistical clustering tech- 
niques or principal-component analysis can be used. 

For the studies outlined in Figure 2. we developed 
the custom cDNA microarray chip ToxChip vl.O. 

Treated 
Population 



| RNA Isolation J 

C >* V ^ . Reverse ^ «- Cy5 

Transcription 




DNA "Chip" 



7, 



TMix cDNAs and 
Apply to Array 



Hybridize Under 
Coverslip 




Figure 1. Simplified overview of the method for sample 
preparation and hybridization to cDNA mtcroarrayv For illus* 



tratrve purposes, samples derived from cell culture are depicted, 
althougn other sample types are amenable to this analysis. 
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J>£!2h* Scnemat,c . "Wmnon of the method for .den- 
tif tcation of a tox!canr$ mechanism of action. In this method 

£mrtVS„« S,0n , daM denved ,rom «"°«"« oT mod" J 
terns to known toxicants are analyzed, and a set of chan Q « 

IZ'£?'T C J° l 1 n of (termed th, ££S5 

signature) .s .dent.f.cd. As deo.cted. oxidant stressors p^od«e 

The 2090 human genes that comprise this subarrav 
were selected for their well-documented involve- 
mem m basic cellular processes as well as their re- 
sponses to different types of toxic insult. Included 
on this list are DNA replication and repair genes 
apoptosis genes, and genes responsive to PAHs and 
dioxtn-like compounds, peroxisome proliferators 
estrogenic compounds, and oxidant stress. Some of 
the other categories of genes include transcription 
ractors. oncogenes, tumor suppressor genes, cvclins 
kinases, pnosphatases. cell adhesion and motility 
genes, and homeobox genes. Also included in this 
group are 84 housekeeping genes, whose hybridiza- 
tion intensity is averaged and used for signal nor- 
malization of the other genes on the chip. To date 
very few toxicants have been shown to have appre^ 
enable effects on the expression of these housekeep- 
ing genes. However, this housekeeping list will be 
revised if new data warrant the addition or deletion 
of a particular gene. Table 1 contains a general de- 
scnption of some of the different classes of *enes 
that comprise ToxChip vl.O. 

When a toxicant signature is determined the 
genes within this signature are flagged within the 
database. When uncharacterized toxicants are then 
screened the data can be quickly reformatted so that 
blocks of genes representing the different signatures 
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are displayed [11]. This facilitates rapid, visual in- 
terpretation of data. We are also developing Tox- 
Chip v-2.0 and chips for other model svste-?s 
including rat. mouse. Xmopus. and veast. for use in 
toxicology studies. 

Animal Models in Toxicology Testing 

The toxicology community relies heavilv on the 
use ot animals as model sv stems ror toxicology test- 
-".=. Lnrorrunateiv. these assavs jre mnerentlv ex- 
pensive, require large numoers ot animals and takea 
long time to complete and analvze. Therefore, the 

, viSJS H St L mte 01 Envlronm ental Health Sciences 
(iNiEHS), the National Toxicology Program, and the 
toxicology community at large are committed to re- 
ducing the number of animals used, by developing 
more efficient and alternative testing methodologies. 
Although substantial progress has been made in the 

;» e » ??f nt of ^^"^e methods, bioassays are 
still used for testing endpoints such as neurotoxic- 
ity, immunotoxiciry, reproductive and developmen- - 
tal toxicology, and genetic toxicology. The rodent 
cancer bioassay is a particularly expensive and time- 
consuming assay, as it requires almost 4 yr. 1200 
animals and millions of dollars to execute and ana- 
lyze [43]. In vitro experiments of the type outlined 
in Figure 2 might provide evidence that an unknown 
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Table 1. ToxChip v1.0: A Human cDNA Microarray 
"Chip Designed to Detea Responses to Toxic insult 

No. c' genes 
6e*e category or 

Apcotosis 72 

DM- replication and repair 99 

Oxicative siresyreoe**omeostasis 90 

Peroxisome oroiiferator resoonsrve 22 

-Dio*in/?AH resoonsive 1 2 

Estrogen resoonsive 63 

Housetceeotng 34 

Oncogenes and tumor suooressor genes 76 

CeH-cyce control 51 

Transcription factors 1 3 1 

"Kinases 276 

P~ ;snatases 88 

Hr--->snock proteins 23 

Receptors 349 

Cytochrome P450s 30 

•This irst is tmenoea as a general cuiae The gene categories are not 
untoue. ano some genes are xsteo m muiitoie categories. 

agent is (or is not) responsible for eliciting a given 
biological response. This information would help to 
v» ; ect a bioassay more specifically suited to the agent 
1. question or perhaps suggest that a bioassay is not 
necessary, which would dramatically reduce cost, 
animal use. and time. 

The addition of microarray techniques to stan- 
dard bioassays may dramatically enhance the sen- 
sitivity and interpretability of the bioassay and 
possibly reduce its cost. Gene-expression signatures 
could be determined for various types of tissue-spe- 
cific toxicants, and new compounds could be 
' *-eened tor these characteristic signatures, provid- 
ing a rapid and sensitive in vivo test. Also, because 
gene expression is often exquisitely sensitive to low 
. doses of a toxicant, the combination of gene-expres- 
sion screening and the bioassay might allow the use 
of lower toxicant doses, which are more relevant to 
human exposure levels, and the use of fewer ani- 
mais. in adcition. sene-expression chances are nor- 
mally measured in nours or days, not in the months 
to years required for tumor development. Further- 
more, microarrays might be particularly useful for 
investigating the relationship between acute and 
chronic toxicity and identifying secondary effects 
of a given toxicant by-studying the relationship 
between the duration of exposure to a toxicant and 
the gene-expression profile produced. Thus, a bio- 
assay that incorporates gene-expression signatures 
with traditional endpoints might be substantially 
shorter, use more realistic dose regimens, and cost 
substantially less than the current assays do. 

These considerations are also relevant for branches 
of toxicology not related to human health and not 
using rodents as model systems, such as aquatic toxi-~ 
cology and plant pathology. Bioassays based on the 
flathead minnow, Daphnia, and Arabadopsis could 
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also be improved by the addition of microarrav an jiv . 
sis. The combination of microarravs witn tradition x 
bioassays might also be useful for investigating some 
of the more intractable problems in toxicology re- 
search, such as the effects of complex mixtures and 
the difficulties in cross-species extrapolation. 

Exposure Assessment. Environmental Monitoring, 
and Drug Safety 

The currently used methods for assessment of ex- 
posure to chemical toxicants are based on measure- 
ment of tissue toxin levels or on surrogate markers 
of toxicity, termed biomarkers (e.g.. peripheral blood 
levels of hepatic enzymes or DNA adducts). Because 
gene expression is a sensitive endpoint. gene expres- 
sion as measured with microarray technology may 
be useful as a new biomarker to more precisely iden- 
tify hazards and to assess exposure. Similarly, 
microarrays could be used in an environmental- 
monitoring capacity to measure the effect of poten- 
tial contaminants on the gene-expression profiles 
of resident organisms. In an analogous fashion, 
microarrays could be used to measure gene-expres- 
sion endpoints in subjects in clinical trials. The com- 
bination of these gene-expression data and more 
established toxic endpoints in these trials could be 
used to define highly precise surrogates of safety. 

Gene-expression profiles in samples from exposed 
individuals could be compared to the profiles of the 
same individuals before exposure. From this infor- 
mation, the nature of the toxic exposure can be de- 
termined or a relative clinical safety factor estimated. 
In the future it may also be possible to estimate not 
only the nature but the dose of the toxicant for a 
given exposure, based on relative gene-expression 
levels. This general approach may be particularly 
appropriate for occupational-health applications, in 
which unexposed and exposed samples from the 
same individuals may be obtainable. For example, 
a pilot study of gene expression in peripheral-blood 
Ivrnphocvtes or Polish coke-oven workers exposed 
:o PAHs iana manv otner compounds* is under con- 
sideration artfte MEHS. An important consideration 
for these types of studies is that gene expression can 
be affected by numerous factors, including diet, 
health, and personal habits. To reduce the effects 
of these confounding factors, it may be necessary 
to compare pools of control samples with pools of 
treated samples. In the future it may be possible to 
compare exposed sample sets to a national database 
of human-expression data, thus eliminating the 
need to provide an unexposed sample from the same 
individual. Efforts to develop such a national gene- 
expression database are currently under way (44,45). 
However, this national database approach will re- 
quire a better understanding of genome-wide gene 
expression across the highly diverse human popu- 
lation and of the effects of environmental factors 
on this expression. 



158 



Alleles. Oligo Arrays, and Toxicogenetics 

djseases of environmental origin [4^^^ 
of toxicology, termed toxicogenetics was recent 
developed tostudy the relatioLhfp ^ n 
vanabil.tvand tox.cant susceptibly, 
not the subject of this discussion, but it£ wonh 
while to note that the ability of oligonuSetSar 
rays to discriminate DNA molecules based7n S 1n R le 
base-pair differences makes these arravT SnioCeW 

i roiect to identify common sequence Dolvmor 
phisms in 200 genes thought to be involved Ten 

" r °r ema diSMSes (48) - In a P»°t study on the 
feasibihry of this application to the Environment 
Genome Project, oligonucleotide arravs 

approach promises to dramaticallv improve *ur un 

FUTURE PRIORITIES 
There are many issues that must be addressed b* 
tore the full potential of microarravs in 

tern selection, dose selection, and the temporal nl 
ture of gene expression. In other wordTKich' 
pec.es at what dose, and at what time do 
for toxicant-induced gene exDression* u Z 
samples are analyzed. 'how SSKj^jS 
expression between individuals, before and after fox"! 

One 0 f the most pressing issues for arrav scientists 

u£l* rnr^ ai ° n ° f 3 ™ imii P ubl * database 
luued ro the existing public daabasesi to serve as a 
repertory tor gene-expression data. This reTnVn^ 

researchers must be encouraged to submit their « 
pression data so that others may view STqS the 

.n?tn? fi ^ VMnade ,audable pro S re * develop 
mg the first generation of such a database f44 4Si T 

Iv^th^ll^ Mn . reCOgnition are n^ed to ana- 
•y» the data in such a public database. 

oStofT'"** of P««forms and meth- 

™£ZT attiy ^ ybtmzition * «*" improve 

patibUity between platforms. In addmo„ toe Se™* 
mfimte variety of experimental conditions under" 
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which data will be colieaed bv different bh« 
nes will make large-scale data anal vs" extreme rat0 " 
hcult To help circumvent n^^f* 

sho U ,d be established. These standards wouic - -7 
tate data entry into the national database aric 

Many issues remain to be resolved, but it is cl#,» 
that new molecular techniques such as m cr^!? 

form the basis for mbSSSSSS!^^^ 
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